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Thioalkalivibrio sp. K90mix is an obligately chemolithoautotrophic, natronophilic sulfur- 
oxidizing bacterium (SOxB) belonging to the family Ectothiorhodospiraceae within the 
Gammaproteobacteria. The strain was isolated from a mixture of sediment samples obtained 
from different soda lakes located in the Kulunda Steppe (Altai, Russia) based on its extreme 
potassium carbonate tolerance as an enrichment method. Here we report the complete ge- 
nome sequence of strain K90mix and its annotation. The genome was sequenced within the 
Joint Genome Institute Community Sequencing Program, because of its relevance to the sus- 
tainable removal of sulfide from wastewater and gas streams. 



Introduction 

Thioalkalivibrio sp. K90mix is an obligately chemo- 
lithoautotrophic SOxB using CO2 as a carbon 
source and reduced inorganic sulfur compounds 
as an energy source. It belongs to the genus 
Thioalkalivibrio. This genus represents a dominant 
SOxB type in soda lakes - extremely alkaline and 
saline habitats - and is the first example of an obli- 
gate chemolithoautotroph capable of growing in 
saturated sodium carbonate brines. It forms a 
monophyletic group within the family Ectothior- 
hodospiraceae of the Gammaproteobacteria. The 
genus currently includes nine validly published 
species [1] and around 70, yet uncharacterized 
strains that are extremely salt-tolerant and genet- 
ically different from the characterized isolates re- 
covered from hypersaline soda lakes [2-4]. The 
members are slow growing obligate autotrophs, 
well adapted to hypersaline (up to salt saturation) 
and alkaline (up to pH 10.5) conditions. Members 
of the genus Thioalkalivibrio have versatile meta- 
bolic capabilities, including oxidation of reduced 



sulfur compounds [1,2], denitrification [5,6] and 
thiocyanate utilization [7,8]. 

Apart from playing an important role in the sulfur 
cycle of soda lakes, Thioalkalivibrio species also 
are being used for the sustainable removal of sul- 
fide from wastewater and gas streams [9,10]. In 
this process hydrogen sulfide is absorbed to a high 
salt alkaline solution, which is subsequently trans- 
ferred to a bioreactor in which Thioalkalivibrio 
spp. oxidize HS- to elemental sulfur. The produced 
biosulfur can then be used as a fertilizer or fungi- 
cide [9]. 

To get a comprehensive understanding of the mo- 
lecular mechanism by which Thioalkalivibrio sp. 
K90mix oxidize sulfur compounds and adapts to 
extreme alkaline (up to pH 10.5) and hypersaline 
conditions (up to 4 M of Na + or 3.6 M of K + ) it is 
necessary to identify the genes that are involved 
in these adaptations. The most important issues in 
this are the mechanism of sulfide oxidation, car- 
bon assimilation at high pH, and bioenergetic 
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adaptation to high salt and high pH. Here we 
present a summary classification and a set of fea- 
tures for Thioalkalivibrio sp. K90mix together with 
the description of the genomic sequencing and 
annotation. 

Classification and features 

Because of limited solubility of sodium carbonates 
in the biodesulfurization process, we made a se- 
ries of enrichment cultures with an increasing ra- 
tio of potassium to sodium carbonate (potassium 
carbonates have a 2-5 times higher solubility than 
sodium carbonates). Thioalkalivibrio sp. K90mix 
was isolated from a culture that was inoculated 
with a mixture of sediment samples from different 
hypersaline soda lakes and was grown at the max- 
imal possible substitution of sodium for potas- 
sium, 3.6 M K + /0.4 M Na + (90% substitution). 

Thioalkalivibrio sp. K90mix has rod-shaped cells 
with a polar flagellum (Figure 1), that elongate at 
high concentrations of K + (Figure lb). The strain is 
obligately alkaliphilic with a pH optimum of 10 
(Table 1). It can tolerate a salinity of 4.0 M total 
Na + , but has an optimum of 0.3 M, sulfide concen- 
trations up to 1 mM and a temperature up to 40°C. 
It has a preference for carbonate and sulfate as 
counter-anions over chloride and, therefore must 
be called "natronophilic", instead of "haloalkali- 
philic". It utilizes ammonia, nitrate and nitrite as a 
nitrogen source. On the basis of 16S rRNA gene 
sequencing the strain belongs to the genus Thioal- 
kalivibrio within the Gammaproteobacteria with T. 
thiocyanoxidans and T. nitratis as the closest de- 
scribed species (Figure 2). Most of the yet unde- 
scribed Thioalkalivibrio isolates from hypersaline 



lakes of Siberia and Mongolia also belong to this 
core genetic cluster of the genus Thioalkalivibrio. 

Genome sequencing information 

Genome project history 

Strain K90mix was selected for sequencing in the 
2007 Joint Genome Institute Community Sequenc- 
ing Program, because of its relevance to biore- 
mediation. A summary of the project information 
is presented in Table 2. The complete genome se- 
quence was finished in February 2010. The Gen- 
Bank accession numbers are NC_013889 and 
NC_013930 for the chromosome and plasmid, re- 
spectively. The genome project is listed in the Ge- 
nome OnLine Database (GOLD) [26] as project 
Gc01217. Sequencing was carried out at the Joint 
Genome Institute (JGI) Finishing was done by JGI- 
Los Alamos National Laboratory (LANL) and ini- 
tial automatic annotation by JGI-Oak Ridge Na- 
tional Laboratory (ORNL). 

Growth conditions and DNA isolation 

Thioalkalivibrio sp. K90mix was grown with 40 
mM thiosulfate as an energy source in standard 
sodium carbonate-bicarbonate medium at pH 10 
and 2 M Na + [2] at 35°C with shaking at 200 rpm. 
The cells were harvested by centrifugation and 
stored at minus 80°C for DNA extraction. Genomic 
DNA was obtained using phenol-chloroform- 
isoamylalcohol (PCI) extraction. The genomic DNA 
was extracted using PCI and precipitated with 
ethanol. The pellet was dried under vacuum and 
subsequently dissolved in water. The quality and 
quantity of the extracted DNA was evaluated using 
the DNA Mass Standard Kit provided by the JGI. 




Figure 1. Phase contrast micrographs of the cell morphology of Thioalkalivibrio sp. K90mix grown at pH 10 
and 4 M Na + (a), or 3.6 M K70.4 M Na + (b). 
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MIGS ID 


Property 


Term 


Evidence code 






Domain Bacteria 


TAS [12] 






Phylum Proteobacteria 


TAS [13] 






Class Gammaproteobacteria 


TAS [14,15] 




Current classification 


Order Chromatiales 


TAS [16] 






Family Ectothiorhodospiraceae 


TAS [1 7] 






Genus Thioalkalivibrio 


TAS [18-21] 






Species Thioalkalivibrio sp. K90mix 


NAS 




Gram stain 


negative 


TAS [2] 




Cell shape 


rod-shaped 


TAS [2] 




Motility 


motile 


TAS [2] 




Sporulation 


non-sporulating 


TAS [2] 




Temperature range 


Mesophile; maximum at41°C 


TAS [2] 




Optimum temperature 


34°C 


TAS [2] 


MIGS-6.3 


Salinity range 


0.2-4.0 M Na + (opt. 0.3 M) 


TAS [2] 


MIGS-22 


Oxygen requirement 


aerobic 


TAS [2] 




Carbon source 


HC0 3 


TAS [2] 




Energy source 


Sulfide/polysulfide, thiosulfate, sulfur, sulfite 


TAS [2] 


MIGS-6 


Habitat 


soda lakes 


TAS [2] 


MIGS-15 


Biotic relationship 


free-living 


TAS [2] 


MIGS-14 


Pathogenicity 


none 


NAS 




Biosafety level 


1 


TAS [22] 




Isolation 


Soda lake sediments 


TAS [4] 


MIGS-4 


Geographic location 


Kulunda Steppe, Altai, Russia 


TAS [4] 


MIGS-5 


Sample collection time 


2005 


NAS 


/V\ILo-4. 1 


Latitude 


IT 1 A 1 
D I .4 I 


1 Ab [15\ 


MIGS-4.2 


Longitude 


79.48 


TAS [23] 


MIGS-4.3 


Depth 


Not applicable 




MIGS-4.4 


Altitude 


Sea level 


NAS 



Evidence codes - IDA: Inferred from Direct Assay (first time in publication); TAS: Traceable Author Statement (i.e., a 
direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, 
isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence 
codes are from of the Gene Ontology project. If the evidence code is IDA, then the property was directly observed 
by one of the authors or an expert mentioned in the acknowledgements. 
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j — Thioalkalivibrio sp. K90mix (CP001906) 

H Thioalkalivibrio thiocyanoxidans ARh2 T (AF302081 ) 

I Thioalkalivibrio nit rat is ALJ 1 2 T (AF1 26547) 

I Thioalkalivibrio jannaschii ALM2 T (AF329083) 

I Thioalkalivibrio versutus AL2 T (AF1 26546) 

I Thioalkalivibrio halophilus HL1 7 T (AY07901 0) 

I Thioalkalivibrio nitratireducens ALEN2 T (AY07901 0) 

Thioalkalivibrio paradoxus ARhl T (AF1 51432) 

i- T?l/oa/to//wfcrto a sulfidophilus" HL-EbCR7 (CP001 339) 

I I Thioalkalivibrio sp. AU1 7 (CU735094) 

j Thioalkalivibrio denitrificans ALJD 1 (AF1 26545) 

I Thioalkalivibrio thiocyanodenitrificans ARhD T (AY360060) 

Thiorhodospira sibirica ATCC 700588 7 (AJ006530) 

Ectothiorhodosinus mongolicum M9 T (AY298904) 

Ectothiorhodospira shaposhnikovii DSM 243 T (M591 51) 

o.oi 

Figure 2. Phylogenetic tree based on 16S rRNA sequences showing the phylogenetic position of Thioalkalivibrio sp. 
K90mix. The sequence was aligned to sequences stored in the SILVA database using the SINA Webaligner [24]. 
Subsequently, the aligned sequences were imported into ARB [25], and a neighbor joining tree was constructed. 
Sequences of members from the Alphaproteobacteria were used as outgroup, but were pruned from the tree. The 
scale bar indicates 1% sequence difference. 



Table 2. Genome sequencing project information 



MIGS ID 


Characteristic 


Details 


MIGS-28 


Libraries used 


6kb and 40kb Sanger and 454 standard libraries 


MIGS-29 


Sequencing platform 


ABI-3730, 454 GS FLX Titanium 


MIGS-31.2 


Sequencing coverage 


10. Ox Sanger, 32.1 x pyrosequence 


MIGS-31 


Finishing quality 


Finished 




Sequencing quality 


Less than one error per 1 00 kb 


MIGS-30 


Assembler 


Newbler, PGA 


MIGS-32 


Gene calling method 


Prodigal, GenePRIMP 




GenBank ID 


NC_013889 




GenBank date of release 


February 21, 2010 




GOLD ID 


Gc01217 




NCBI project ID 


30759 




IMG Taxon ID 


646564584 


MIGS-13 


Source material identifier 


Personal culture collection, Winogradsky Institute of 
Microbiology, Moscow 




Project relevance 


Bioremediation 
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Genome sequencing and assembly 

The genome of Thioalkalivibrio sp. K90mix was 
sequenced using a combination of Sanger and 454 
sequencing platforms. All general aspects of li- 
brary construction and sequencing can be found 
at the JGI website [27]. Pyrosequencing reads 
were assembled using the Newbler assembler ver- 
sion 1.1.02.15 (Roche). Large Newbler contigs 
were broken into 3,292 overlapping fragments of 
1,000 bp and entered into assembly as pseudo- 
reads. The sequences were assigned quality scores 
based on Newbler consensus q-scores with mod- 
ifications to account for overlap redundancy and 
adjust inflated q-scores. A hybrid 454/Sanger as- 
sembly was made using the PGA assembler. Possi- 
ble mis-assemblies were corrected and gaps be- 
tween contigs were closed by editing in Consed, 
by custom primer walks from sub-clones or PCR 
products. A total of 181 Sanger finishing reads 
were produced to close gaps, to resolve repetitive 
regions, and to raise the quality of the finished se- 
quence. Illumina reads were used to improve the 
final consensus quality using an in-house devel- 
oped tool (the 'Polisher' [28]). The error rate of 
the completed genome sequence is less than 1 in 
100,000. Together, the combination of the Sanger 
and 454 sequencing platforms provided 42. lx 
coverage of the genome. The final assembly con- 
tains 28,443 Sanger reads (10.0x) and 419,015 
pyrosequencing reads (32. lx). 

Genome annotation 

Genes were identified using Prodigal [29] as part 
of the Oak Ridge National Laboratory genome an- 
notation pipeline followed by a round of manual 
curation using the JGI GenePRIMP pipeline [30]. 
The predicted CDSs were translated and used to 
search the National Center for Biotechnology In- 
formation (NCBI) nonredundant database, Uni- 
Prot, TIGRFam, Pfam, PRIAM, KEGG, COG, and In- 
terPro, databases. Additional gene prediction 
analysis and functional annotation was performed 
within the Integrated Microbial Genomes Expert 
Review (IMG-ER) platform [31]. 

Genome properties 

The genome of strain K90mix consists of a circular 
chromosome with a size of 2.74 Mbp (Figure 3) 
and a linear plasmid of 240 Kbp. The G+C percen- 
tage determined from the genome sequence is 
65.54%, which is similar to the value determined 
by thermal denaturation (65.8±0.5 mol%). There 
are 2942 genes of which 2888 are protein-coding 



genes and the remaining 54 are RNA genes. 33 
pseudogenes were identified, constituting 1.12% 
of the total number of genes. The genome is small- 
er than that of "Thioalkalivibrio sulfidophilus" HL- 
EbGr7 [32], 2.98 Mbp versus 3.46 Mbp, but has a 
similar percentage of protein-coding genes 
(98.16% versus 98.06%). The properties and sta- 
tistics of the genome are summarized in Table 3, 
and genes belonging to COG functional categories 
are listed in Table 4. 

Insights from the genome sequence 
Autotrophic growth 

As mentioned before, [32] autotrophic growth at 
extremely high pH is a problem, because inorganic 
carbon is mainly present as carbonate (with bi- 
carbonate as a minor fraction) at pH values above 
10. This would demand active transport of bicar- 
bonate into the cell. We found a gene related to 
stbA encoding a Na + /HC03" symporter in the ma- 
rine cyanobacterium Synechocystis sp. strain PCC 
6803 [33]. Figure 4 shows a phylogenetic tree of 
different sequences related to StbA and the hy- 
drophobicity profiles of StbA of Synechocystis sp. 
PCC 6803 and Thioalkalivibrio sp. K90mix. In addi- 
tion, we have found genes for the large 
(TK90_0858) and small subunit (TK90_0859) of 
RuBisCO form lAc, and for the synthesis of a- 
carboxysomes (TK90_0860 - TK90.0866), includ- 
ing csoSCA encoding a carboxysome shell alpha- 
type carbonic anhydrase, which was also found in 
genomic analysis of "Thioalkalivibrio sulfidophilus" 
HL-EbGr7 [32]. 

Sulfur metabolism 

Thioalkalivibrio sp. K90mix can oxidize sul- 
fide/polysulfide, thiosulfate, sulfite [in vitro) and 
elemental sulfur to sulfate. Elemental sulfur is 
formed as an intermediate during sulfide and thi- 
osulfate oxidation at oxygen limitation and near- 
neutral pH. Figure 5 shows a schematic overview of 
the different genes that are involved in the oxida- 
tion of sulfur compounds. The genome of Thioalka- 
livibrio sp. K90mix contains genes for flavocytoch- 
rome c/sulfide dehydrogenase (TK90_0236), which 
oxidizes sulfide to elemental sulfur. It contains an 
incomplete set of sox genes including soxYZ 
(TK90.0123 and TK90.0124), soxAX (TK90.0432 
and TK90.0433) and two copies of soxB 
(TK90.0627 and TK90.1150), but is lacking soxCD, 
which would allow oxidizing the sulfane atom of 
thiosulfate to the state of elemental sulfur, but no 
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further. However, it does not contain the reverse 
dissimilatory sulfite reduction pathway to oxidize 
sulfur to sulfite, which has been found in the ge- 
nome of "Thioalkalivibrio sulfidophilus" HL-EbGr7 
[32]. Absence of dsr genes has also been found for 
the green sulfur bacterium Chloroherpeton thalas- 
sium that can oxidize sulfide to elemental sulfur, 
but subsequently can only oxidize the produced 
sulfur very slowly [34], probably due to the ab- 
sence of dsr. Frigaard and Dahl [35] suggested that 
the presence of a RuBisCo-like protein (RLP) might 
be involved in sulfur oxidation [36]. Genes encod- 



ing for the RuBisCo-like protein were not found, 
nor were genes encoding sulfur dioxygenase or sul- 
fur oxygenase-reductase, which can oxidize or dis- 
proportionate sulfur in several acidophilic bacteria 
and archaea [37]. However, we found a gene cluster 
encoding two sulfur transferases [rhd, TK90_0630; 
sirA, TK90_0631) and a heterodisulfide reductase 
complex (TK90.0632 - TK90_0637) consisting of 
IndrA, IndrB, and hdrC (Figure 6). dsrE was missing 
in this cascade, but was present at 3 other places in 
the genome (TK_0511, TK_0639, TK90.1244). 
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Figure 3. Graphical circular map of the chromosome of Thioalkalivibrio sp. strain K90mix. From out- 
side to the center: Genes on the forward strand (Colored by COG categories), Genes on the reverse 
strand (colored by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC con- 
tent, GC skew. 
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Table 3. Genome statistics 



Attribute 


Value 


% of Total 


Genome size (bp) 


2,985,056 


100.00% 


DNA coding region (bp) 


2,673,521 


89.56% 


DNA G+C content (bp) 


1,956,383 


65.54% 


Number of replicons 


1 




Extrachromosomal elements (plasmid) 


1 




Total genes 


2,942 


100.00% 


RNA genes 


54 


1 .84% 


rRNA operons 


3 


0.10% 


Protein-coding genes 


2,888 


98.16% 


Pseudogenes 


33 


1.12% 


Genes in paralog clusters 


267 


9.08% 


Genes assigned to COGs 


2,139 


72.71% 


Genes assigned Pfam domains 


2,292 


77.91% 


Genes with signal peptides 


934 


31.75% 


CRIPR repeats 


2 





Table 4. Number of genes associated with the general COG functional categories. 
Code Value %age Description 



J 


156 


6.57 


Translation, ribosomal structure and biogenesis 


A 


1 


.04 


RNA processing and modification 


K 


86 


3.62 


Transcription 


L 


164 


6.91 


Replication, recombination and repair 


B 


2 


0.08 


Chromatin structure and dynamics 


D 


41 


1.73 


Cell cycle control, mitosis and meiosis 


Y 


0 


0.0 


Nuclear structure 


V 


35 


1.47 


Defense mechanisms 


T 


166 


6.99 


Signal transduction mechanisms 


M 


176 


7.41 


Cell wall/membrane biogenesis 


N 


94 


3.96 


Cell motility 


Z 


0 


0.0 


Cytoskeleton 


W 


0 


0.0 


Extracellular structures 


u 


97 


4.09 


Intracellular trafficking and secretion 


o 


110 


4.63 


Posttranslational modification, protein turnover, chaperones 


c 


146 


6.15 


Energy production and conversion 


G 


79 


3.33 


Carbohydrate transport and metabolism 


E 


155 


6.53 


Amino acid transport and metabolism 


F 


66 


2.78 


Nucleotide transport and metabolism 


H 


146 


6.15 


Coenzyme transport and metabolism 


I 


65 


2.74 


Lipid transport and metabolism 


P 


119 


5.01 


Inorganic ion transport and metabolism 


Q 


42 


1.77 


Secondary metabolites biosynthesis, transport and catabolism 


R 


240 


10.11 


General function prediction only 


S 


188 


7.92 


Function unknown 




803 


27.29 


Not in COGs 
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Figure 4. Phylogeny and hydropathy of StbA from different microorganisms. The blue boxes indicate sequences 
belonging to the cyanobacteria. The dots on the branches are bootstrap values between 50% and 75% (open 
dots) or between 75% and 100% (closed dots) The scale bar indicated 10% sequence difference. The sequence 
of Thioalkalivibrio sp. K90mix is indicated in bold. Hydropathy profiles were made for the StbA protein se- 
quences of Synechocystis sp. PCC6803 and Thioalkalivibrio sp. K90mix to indicate transmembrane spanning 
domains. 



The Hdr complex plays a function in the energy 
metabolism of methanogens [38] and sulfate- 
reducing prokaryotes [39]. In methanogens, the 
enzyme complex catalyzes the reversible reduc- 
tion of the disulfide (CoM-S-S-CoB) of the two 
methanogenic thiol-coenzymes, coenzyme M 
(CoM-SH) and coenzyme B (CoB-SH); in sulfate 
reducing microorganisms the substrate (X-S-S-X) 
is not known. Recently, the genes encoding the 
Hdr complex have also been detected in the ge- 
nomes of the acidophilic sulfur oxidizing bacte- 
ria Acidithiobacillus ferrooxidans [40] and Acidi- 
thiobacillus caldus [41]. Quantrini and co- 
workers [40] hypothesized that Hdr, like the dis- 
similatory sulfite reductase (dsr), is working in 
reverse, whereby sulfur (i.e., sulfane atom) is 
oxidized to sulfite, and electrons are donated to 
the quinone pool. Although the role of the Hdr 
complex in the sulfur metabolism still has to be 
confirmed by expression analysis, Hdr genes 
have not yet been detected in the genomes of the 
neutrophilic chemolithotrophic sulfur-oxidizing 
bacteria Thiomicrospira crunogena and Thioba- 



cillus denitrificans or of phototrophic sulfur- 
oxidizing bacteria Allochromatium vinosum and 
Halorhodospira halophila. The absence of a re- 
verse dsr pathway in Thioalkalivibrio sp. K90mix 
might be the reason why it can only tolerate sul- 
fide concentrations up to 1 mM. Sulfite can be 
oxidized further to sulfate, either directly by sul- 
fite dehydrogenase [sorA; TK90_0686) or indi- 
rectly via adenosine-5'-phosphosulfate (APS) by 
APS reductase encoded by aprBA (TK90.0064 
and TK90.0065) and ATP sulfurylase encoded by 
sat (TK90.0062) [42]. All these genes are 
present in the investigated genome. 

Energy metabolism and pH homeostasis 

At this time, it is not clear how Thioalkalivibrio sp. 
K90mix can withstand the harsh conditions of 
high pH and salinity. The difference between the 
pH of the environment (pH 10) and the pH in the 
cell (pH 8) causes a reversed ApH and consequent- 
ly lowers the proton motive force (PMF). There- 
fore, Thioalkalivibrio requires a special molecular 
mechanisms to obtain enough energy for growth. 
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It certainly needs this energy, because the produc- 
tion of osmolytes, to withstand the high concen- 
trations of salts, costs 55 molecules of ATP for one 
molecule of glycine betaine, and 110 molecules of 
ATP for 1 molecule of sucrose [43]. In addition, 
the chemolithoautotrophic life style of CO2 fixation 
is energetically very expensive. The redox poten- 
tial of the substrate couple S°/HS- (-260 mV) is 
more positive that the potential of NAD + /NADH (- 
340 mV) and therefore the direct reduction of 



NAD + in order to supply reducing equivalents for 
CO2 fixation is not possible. Reverse electron 
transport is necessary in order to produce enough 
NADH, necessary for CO2 fixation, which costs ex- 
tra energy. In addition, because of the large pH 
gradient over the cell membrane Thioalkalivibrio 
needs special mechanisms to keep the intracellu- 
lar pH around neutral (pH homeostasis), which 
again is an energy requiring process. 



HS 



Flavocytochrome c/ 
sulfide dehydrogenase 
fccAB 



Reverse heterodisulfide 
reductase complex 
hdrABC 



Sox system 
soxABX YZ 



s,o, ; 



APS reductase 
aprBA 



so 



2- 



t 

APS 



ATP sulfurylase 
sat 



sulfite dehydrogenase 
sorA 



> so/ 



Figure 5. Schematic overview of the different genes that are involved 
in the oxidation of sulfur compounds, although the role of the Hdr 
complex has not been proven. The genes encoding the reverse dissimi- 
latory sulfite reductase (dsr), which are present in the genome of 
Thioalkalivibrio sulfidophilus, are absent in the genome of Thioalkalivi- 
brio sp. K90mix. 
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Figure 6. Comparison of the hdr cluster of A. ferroooxidans ATCC 23270 (AF), Thioalkalivibrio sp. 
K90mix (K90) and Thioalkalivibrio sulfidophilus HL-EbGr7 (HL). The heterodisulfide reductase complex 
consists of the genes encoding HdrC1, HdrB1, HdrA, orf2, HdrC2 and HdrB2. The accessory proteins 
are Rhd, and TusA. DsrE was not found in this order, but was present at other places in the genome. 
The percentage of amino-acid similarity is indicated. 




The genome has revealed genes encoding similar 
proteins as those found for "Thioalkalivibrio sulfi- 
dophilus" HL-EbGr7 [32]. We found genes for a 
proton-driven FOFl-type ATP synthase (i.e., sub- 
unit A TK90.2593, B TK90.2591, and C 
TK90_2592of the F0 subcomplex, and subunit al- 
pha (TK90.2589), beta (TK90.2587), gamma 
(TK90.2588), delta (TK90.2590), and epsilon 
(TK90_2586) of the Fl subcomplex), genes encod- 
ing the proton-translocating NADH dehydroge- 
nase [nuoABCDEFGHlJKLMN] (TK90.0708 to 
TK90_0721), as well as the genes for a putative 
primary sodium pump Rnf [44] [rnfABCDGE] 
(TK90.1790 to TK90.1795). In addition, we found 
several genes encoding different secondary so- 
dium-dependent pumps, such as the Na + /H + anti- 
porters NhaP (TK90.1831) and Mrp [mnhA-G) 
(TK90_0748 to TK90.0752), which according to 
Padan et al. [45] both play an essential role in al- 
kaline pH homeostasis. In addition, we found 
genes encoding transporters belonging to the SulP 
family (TK90.0019, TK90.0897, TK90.0985). 
Transporters of this group could be involved in 
the low affinity, but high flux of bicarbonate up- 
take [46]. In addition, genes encoding the sodium- 
depending flagellar motor PomA/B (TK90_1180 
and TK90_1181) are also present in the genome 
(see below for more details). As Thioalkalivibrio 
sp. K90mix can stand high concentrations of po- 
tassium, we also searched for K + -transporters and 
found genes encoding the following transporters: 
TrkA-C (TK90.0502), TrkA-N (TK90_2266) and 
TrkH (TK90_2267) that are part of the potassium 
uptake system [47]. 



Chemotaxis and motility 

We found different genes encoding methyl-accepting 
chemotaxis sensory transducers (TK90_0580, 
TK90.0949, TK90.1402, TK90.2562, TK90.2397) 
that are involved in chemotaxis. One of these genes, 
Aer (TK90_0580), encodes a redox sensor involved 
in aerotaxis. In E. coli, Aer regulates the motility be- 
havior in gradients of oxygen, redox potential and 
certain nutrients by interacting with the CheA-CheW 
complex. We found genes encoding several different 
proteins of this complex, CheA (TK90.1178), CheW 
(TK90.1183 and TK90.1184), CheY (TK90.1176), 
CheZ (TK90.1177), CheB (TK90.1179), CheV 
(TK90_0924) and CheR (TK90.0925). Chemotaxis 
consists of a complex cascade of different reactions: 
the redox sensor Aer senses a difference in redox 
potential induced by a change in the environmental 
oxygen concentration, which leads to the autophos- 
phorylation of the histidine protein kinase CheA. 
CheA phosphorylates CheY, which will switch on the 
flagellar motor (see [48] for a detailed overview). 
CheW acts as an adaptor protein, while CheB, CheR, 
CheZ, and CheV are involved in feedback regulation. 

Thioalkalivibrio sp. K90mix has all the genes that are 
indispensable for the production of flagellar proteins 
(FlgA, TK90.0923; FlgB, TK90.0926; FlgC, 
TK90.0927; FlgD, TK90.0928; FlgE, TK90.0929; 
FlgF, TK90_0930; FlgG, TK90_0931; FlgH, 
TK90.0932; Flgl, TK90.0933; FlgK, TK90_0935; 
FlgL, TK90.0936; FlgM, TK90.0922; FlgN, 



TK90.0921 
TK90_1447 
TK90.1159 
TK90.1162 



FliC, TK90_1400, 1448, 1450; FliD, 
FliE, TK90.1157; FliF, TK90.1158; FliG, 
FliH, TK90.1160; Flil, TK90.1161; FliJ, 

FliK, TK90.1163; FliM, TK90.1165; 



FliN, TK90.1166; FliO, TK90.1167; FliP, TK90.1168; 
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FliQ, TK90.1169; FUR, TK90_1170; FliS, TK90.1446; 
FlhA, TK90_1172; FlhB, TK90_1171; see Table 1 in 
[49]). Phylogenetic analysis of sequences encoding 
different flagellar motors showed, with significant 
bootstrap values, genes encoding both proton- 
driven motors (TK90_0578, TK90_0577) related to 
£ coli MotA and MotB, as well as sodium-driven mo- 
tors (TK90_1180, TK1181) related to Vibrio cholera 
PomA and PomB and Bacillus subtilis MotP and MotS 
(Figure 7). The results also show the presence of 



proton- and sodium-driven flagellar motors in the 
Ectothiorhodospiraceae, "Thioalkalivibrio sulfido- 
philus", Halorhodospira halophila, and Alkalilimnico- 
la ehrlichei, as well as in Achromatium vinosum, Aci- 
dithiobacillus caldus, Vibrio alginolyticus and Vibrio 
parahaemolyticus. Vibrio cholerae, Bacillus subtilis, B. 
halodurans, and Nitrococcus mobilis only have so- 
dium-driven flagellar motors. Grouping of the pro- 
ton-driven MotA and sodium-driven PomA has also 
been found by Krulwich et al. [50]. 




Figure 7. Phylogenetic tree based on protein sequences of different flagellar motors. The sequences of 
the proton-driven flagellar motor MotAB from E. coli (ECDH1 0B_2031 , ECDH10B_2030) and of the 
sodium-driven flagellar motor PomAB from V. cholerae (VC0892, VC0893) were used as reference 
proteins. Other proteins were selected after BLAST analysis. Subsequently, the selected protein se- 
quences were aligned used Clustal W, and a neighbor joining tree was drawn using MEGA 5. Ac, Aci- 
dithiobacillus caldus ATCC 51 756; Ae, Alkalilimnicola ehrlichei MLHE-1 ; Av, Allochromatium vinosum 
DSM 180; Bh, Bacillus halodurans C125; Bs, Bacillus subtilis strain 168; Ec, Escherichia coli DH10B; 
Hh, Halorhodospira halophila SL1; Nm, Nitrococcus mobilis NB-231; Ts, Thioalkalivibrio sulfidophilus 
HLEbGR7; Tv-K90, Thioalkalivibrio sp. K90mix; Va, Vibrio alginolyticus 12G01 ; Vc, Vibrio cholera 01 ; 
Vp, Vibrio parahaemolyticus RIMD22 10633. The sequences of Thioalkalivibrio sp. K90mix are shown 
in bold type. The bar indicates 20% sequence difference. Numbers on the branches indicate percen- 
tage bootstrap values from 1000 iterations; only those values are shown that distinguish the different 
flagellar motors. The CheW sequence of Thioalkalivibrio sp. K90mix (TK90_1 1 83) was used as an out- 
group, but was pruned from the tree. 
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Transposases and environmental stress 

Comparative analysis of the genomes of 
"Thioalkalivibrio sulfidophilus" HL-EbGr7 [32] and 
Thioalkalivibrio sp. K90mix showed a greater ab- 
undance of genes encoding different transposases 
(i.e., COG2801, COG3328, COG3547) in the latter. 
Transposases are enzymes that can move specific 
sequences of DNA, known as transposons or 
transposable elements, within the genome. Krul- 
wich [51] found that the genome of the alkaliphilic 
bacterium Bacillus halodurans C125 contained 
112 transposase genes as compared to 10 in the 
genome of its closest non-alkaliphilic relative B. 
subtilis. She suggested that this might be one of the 
mechanisms of alkaliphilic adaptation at the ge- 
nome level. Although strains HL-EbGr7 and 
K90mix are both obligately alkaliphilic, they differ 
in salt tolerance. HL-EbGr7 can tolerate only low 
(up to 1.5 M) salt concentrations, while K90mix 
can tolerate high (up to 4 M) salinities. 

Capy et al. [52] mentioned that environmental 
stress might stimulate transposition and conse- 
quently increase the genetic variability, which can 
be beneficial for the adaptation to novel environ- 
mental conditions. Foti et al. [4] used rep-PCR [53] 
to study the genetic diversity within the genus 
Thioalkalivibrio and found a relatively high diver- 
sity of 56 genotypes among 85 strains that were 
isolated from different soda lakes in Africa and 
Asia. In addition, preliminary enrichment experi- 
ments with potassium carbonate instead of so- 
dium carbonate and higher concentrations of 
chloride selected populations of high salt-tolerant 
Thioalkalivibrio strains with different rep-PCR 
patterns (unpublished results), which might be an 
indication that transposition might occur more 
frequently in strains with a wide range of salt to- 
lerance. 



Oxidative stress 

Reactive oxygen species (ROS), such as superox- 
ides (O2 ) and hydrogen peroxidase (H2O2), are 
naturally produced at hypersaline conditions and 
are deleterious to cellular macromolecules. To 
protect themselves from this oxidative stress, 
Thioalkalivibrio sp. K90mix and "Thioalkalivibrio 
sulfidophilus" HL-EbGr7 have several defense 
mechanisms. Superoxides are converted to oxy- 
gen and hydrogen peroxide by the enzyme supe- 
roxide dismutase (TK90.0947, Tgr7_2463), while 
hydrogen peroxide is converted to oxygen by hy- 
droperoxidase (TK90.0947, Tgr7_1107) or to 
H2O by the cytochrome C peroxidase 
(TK90_0812, Tgr7_2739). In addition, Thioalkali- 
vibrio sp. K90mix produces high concentrations 
of a specific membrane-bound yellow pigment 
named 'natronochrome' [54]. The pigment has a 
high degree of unsaturation and might also play a 
role in the protection against reactive oxygen 
species (ROS). The gene(s) responsible for the 
synthesis of this anti-oxidant remains to be iden- 
tified. 



Osmotic stress 

Thioalkalivibrio sp. K90mix is an extremely salt- 
tolerant bacterium. It can grow in saturated so- 
dium and potassium carbonate and sodium sul- 
fate brines containing up to 4 M Na + /K + but, in 
contrast to halo-alkaliphiles, it is inhibited by 
high concentrations of chloride. So, a more prop- 
er term for such an extremophile would be an 
"extreme natronophile". To withstand these ex- 
treme salinities, it synthesizes glycine betaine as 
the main compatible solute; the genome contains 
the genes for glycine sarcosine N- 
methyltransferase (TK90_0179) and sarcosine 
dimethylglycine methyltransferase (TK90_0180). 
In addition, the genome contains the gene for su- 
crose phosphate synthase (TK90_2312) to pro- 
duce sucrose as a compatible solute. Genes for 
ectoine were not found. 
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